Experiments on the zero frequency problemJohn
نویسندگان
چکیده
1 Introduction The best algorithms for lossless compression of text are those which adapt to the text being compressed 1]. Two classes of such adaptive techniques are commonly used. One class matches the text against a dictionary of strings seen and transforms the text into a list of indices into the dictionary. These techniques are usually formulated as a variant on Ziv-Lempel (LZ) compression. While LZ compressors do not give the best compression they are widely used because of their simplicity and low execution overhead. The best compression is obtained by another class of compressors which use adaptive statistical modelling. These split compression into two steps. The rst step accumulates a statistical model of the characters seen so far in the input text. As each character is encoded this model is used to generate a probability distribution over those characters which can occur next. Arithmetic coding is then used to optimally encode the character which actually does occur with respect to this distribution. The best compression has been obtained from a series of variants of PPM modelling 1]. PPM models are built up by counting the characters that have occurred following contexts of prior characters. For example, all the characters followingà' are recorded. The next timèa' occurs the counts associated with it are used to generate the probability distribution for the following character. The PPM techniques blend together the predictions from contexts of varying lengths to arrive at an overall probability distribution. For practical reasons of memory usage and execution time most PPM variants x an upper bound to the lengths of the contexts, although recently a variant which uses unbounded length contexts has been very successful 2]. The focus of this paper is the problem of transforming the set of counts accumulated for a particular context into a probability distribution. To simplify our discussion and later experiments we will focus on the case when the alphabet of characters is binary with just two symbols: 0 and 1. Now in a statistical model each context will deliver two counts: C 0 , the number of times a 0 has occurred, and C 1 , the number of times a 1 has occurred. A naive estimate of the probability of character i could be obtained by the ratio
منابع مشابه
On Interaction of T S Waves and 3 D Localized Disturbance in a Divergent Flow Under Zero Pressure Gradient
To simulate the effect of free st ream turbulence on turbulent spot formation, experiments were conducted on the interaction of localized three-dimensional disturbances with the harmonic waves in a laminar boundary layer on a flat plate. Experiments conducted in three-dimensional diverging flow (but zero pressure gradient) show, while individually the disturbances decay downstream, their intera...
متن کاملCompensation of Doppler Effect in Direct Acquisition of Global Positioning System using Segmented Zero Padding
Because of the very high chip rate of global positioning system (GPS), P-code acquisition at GPS receiver will be challenging. A variety of methods for increasing the probability of detection and reducing the average time of acquisition have been provided, among which the method of Zero Padding (ZP) is the most essential and the most widely used. The method using the Fast Fourier Transform (FFT...
متن کاملExperimental Investigation of the Effect of Splitter Plate Angle on the Under-Scouring of Submarine Pipeline Due to Steady Current and Clear Water Condition
Submarine pipelines are appropriate method for transmission of oil and gas from sea bed. Free spans may occur due to the natural uneven seabed or by under-scouring. Vortex Induced Vibration (VIV) can happen in such free spans at high Reynolds number. Resonance occurs if the frequency of vortex shedding is close to the pipeline’s natural frequency leading to its fatigue that can break the pipeli...
متن کاملAnalysis and Simulation of ZVS Methods in Full Bridge Converters and Realization of a 3 KW Prototype
One of the difficulties with PWM switching converters is high switching loss and electromagnetic interference due to switching at non-zero voltage and current, which limits the operating frequency. In order to reduce the converter volume and weight (by increasing the frequency) and reducing switching losses, zero voltage and current switching methods are recommended. In this paper, four main ze...
متن کاملAnalysis and Simulation of ZVS Methods in Full Bridge Converters and Realization of a 3 KW Prototype
One of the difficulties with PWM switching converters is high switching loss and electromagnetic interference due to switching at non-zero voltage and current, which limits the operating frequency. In order to reduce the converter volume and weight (by increasing the frequency) and reducing switching losses, zero voltage and current switching methods are recommended. In this paper, four main ze...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995